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Abstract 

Based on the probability distributions of products of random variables, we pro- 
pose a simple stringy mechanism that prefers the meta-stable vacua with a small 
cosmological constant. We state some relevant properties of the probability distribu- 
tions of functions of random variables. We then illustrate the mechanism within the 
flux compactification models in Type IIB string theory. As a result of the stringy 
dynamics, we argue that the generic probability distribution for the meta-stable 
vacua typically peaks with a divergent behavior at the zero value of the cosmologi- 
cal constant. However, its suppression in the single modulus model studied here is 
modest. 
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1 Introduction 

Recent cosmological data strongly suggests that our universe has been sitting at a vacuum state 
with an exponentially small positive vacuum energy density, or cosmological constant A. The 
argument of Bousso and Polchinski |1;| (see also reviews [2j|3]) suggests that string theory has so 
many possible vacua solutions that, in principle, it can have solutions that are consistent with 
such a very small positive A. However, their argument leaves open the most pressing question 
why string theory prefers a vacuum solution with such a very small A. Here we like to propose a 
plausible reason why this may happen within the context of string theory. The basic underlying 
physics is very simple. After pointing out the conditions under which a stringy solution with 
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Fi gure 1: Some examples of the product distribution of z where each Xj has an uniform ([-1,1]) distribution 
(on the left hand side) or a normal (Gaussian) distribution with variance <jj = 1 (on the right hand side). The 
product distribution P(z) is for n = 1 (z = x±, solid brown curve), n — 2 (z = X\x 2 , red dashed curve), and 
n = 3 (z = X1X2X3, blue dotted curve), respectively. The curves on the left are given in (1.1 1. The curves on 
the right are given by the Meijer-G function (2.11 ), which reduces to the modified Bessel function of the second 
kind for n = 2. 



a small A may be preferred, we illustrate with some examples in flux compactification in Type 
IIB theory. 

The key idea follows from some simple well known properties of probability distributions 
for products of random variables. Consider a set of random variables Xj (j = 1, ■ ■ ■ ,n). As an 
example, if each random variable Xj has a uniform probability distribution between [—1, 1], then 
the probability distribution P{z) for their product is given by 

for z = X1X2 ■ ■ ■ x n , where — 1 < z < 1. Note that P(z) peaks and in fact diverges at z — 0. 
This peaking property is insensitive to the particular probability distributions of Xj as long 
as each distribution smoothly includes zero. For example, if each Xj has a normal (Gaussian) 
distribution, then P(z) still diverges like ln(l/|2;|) n-1 at z = 0. For z = x™, P(z) diverges 
like as \z\ — » 0. The probabilty distribution for Xj and the product distributions for 

z = X1X2 and z = xyx^Xj, are illustrated in Figure [TJ 

To obtain a low energy effective theory in string theory, we have to compactify and stabilize 
the extra dimensions (see e.g. [2]). A compactification is described by the compactified manifold's 
shape and sizes, which collectively are referred to as moduli. Different sets of moduli may be 
continuously morphed into each other as the geometry changes. The cosmic stringy landscape is 
described by these moduli. A specific flux compactification can be achieved via the introduction 
of a dozen or more types of 4- form quantized fluxes jT|, whose discrete values can easily range 
to many millions. This feature translates to moduli that can take numerous values. At low 
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energies, the landscape may be described by an effective potential V, which is a function of all 
the light moduli. To simplify the analysis, we assume that most of the moduli except a few have 
already been stabilized at some high scales. They may be integrated out, so these stabilized 
moduli are now replaced by a set of parameters in the low energy effective potential V. Now, 
some of these parameters may take numerous values. In sampling the landscape, we may treat 
some of the parameters as random variables taking values that smoothly include zero. 

In this paper, we study a particularly simple example in Type IIB flux compactification, 
namely, the single modulus model in the Kahler uplifting scenario [4jj7] in the large volume 
approximation M. Consider the superpotential W = W$—Ae~ x , where W$ and A are parameters 
and x is a Kahler modulus. In the large volume approximation, the potential V(x) may be 
reduced to 2 terms only. We are interested in a meta-stable vacuum (which may quantum 
tunnel to another vacuum state). Such a stable vacuum can exist at x — x m . If so, A at this 
stable vacuum (in a simplified version) turns out to have the form 

A ~ BW A{x m - x ) (1.2) 

where B is a constant. In this particular model Xq = 5/2. Depending on the input parameters 
and the Kahler potential, we may tune (x m — Xq) to zero or to an exponentially small value. 
However, this is fine tuning. On the other hand, let us treat Wq > and A > as random 
parameters. Then, intuitively at least, the resulting probability distribution P(A) for A will 
naturally peak at A = 0. That is, P(A) in general diverges at A = 0. 

Note that the above feature is where string theory differs from conventional field theory. In 
field theory, or supergravity, the parameters in V take fixed values. So there is no distribution 
of A to talk about, and the above peaking behavior is absent. It is the string landscape that 
provides the reason why some of the parameters may be allowed to take multiple values R 



We shall make some plausible inputs so some of the parameters that enter into (1.2) can take 
values that include zero. However, the specific peaking behavior is different with different 
plausible inputs, resulting in different levels of suppression of the expected value for A. So we 
see intuitively that the peaking behavior at A = is a stringy feature. It is the result of high 
scale string dynamics in flux compactification that is believed to be present. 

Although the above peaking behavior captures some of the stringy features, it alone does 



not tell the whole story. The average magnitude of the naive A (1.2) simply follows from the 
product of the average magnitudes of the parameters emerging from string theory. In string 
theory, the actual A is the value of a meta-stable minimum of V. The extremum (dkV = 0) 
and the stability (positivity of the Hessian matrix dj-diV) conditions impose a constraint on the 
ratio Wo/ A. Apriori, this type of constraints may erase the peaking behavior coming from the 

: Of course one can endow a supergravity model with a collection of quantized 4-form fluxes and branes. We 
consider such models as string-motivated and the mechanism there as stringy as well. 
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above intuitive argument. In this case, if Wq, A and (x m — xq) smoothly include zero, we expect 
-P(A) to diverge like (In A) 2 (at A = 0). However, as we shall see, the constraint cuts the power 
by one, to P(A) ~ ln(l/A). Treating other parameters in the model as random, P(A) can even 
have a power-like divergence at A = 0. However, with uniform distributions for the random 
parameters, the mechanism produces only a modest suppression of A. On the other hand, even 
with only 2 random parameters Wq and A, they themselves may be non-trivial functions of the 
other stabilized moduli (at some higher scales) in flux compactification, so their distributions 
may be peaked already If the probability distributions of Wq and A are themselves peaked at 
zero (which is very likely if we look at how they arise f) then P(A) will be more sharply peaked 
at zero. As the peak becomes sharper, a smaller A becomes more likely. 

As we just explained, the actual stringy mechanism for a small A in the single modulus 
model in the Kahler uplifting scenario [4jj6] is a little involved due to the constraints. Under 
plausible conditions, we see that the peaking behavior for A is modified but not erased by the 
constraints. Furthermore, the constraints tend to limit the ranges of the random parameters, 
so the resulting A can be substantially smaller than the naive A. Even in such a very simple 
model, we already see that the preferred A can be smaller than the typical values suggested 
by the string/Planck scale. This reduction in the expected value of A is the output of explicit 
moduli stabilization within string dynamics. We believe that the preference for a small A in 
string theory is a generic feature of the string dynamics in flux compactification. 

In a scenario with a single modulus 0, the presence of a 0-independent term in V typically 
erases the above peaking feature. That is, when a term that peaks at zero is added to an 
independent term, the peaking feature is usually erased. More generally, if any subset of moduli 
does not couple to the rest of the moduli (the above example has an empty subset), the above 
peaking behavior will generically be erased. So we simply assert that such couplings among 
the moduli are necessary for the peaking phenomenon for a small A. In scenarios with more 
than one modulus, it is important that they are coupled to each other, directly or indirectly. 
Otherwise, A ends up to be a sum of terms, typically erasing the peaking phenomenon. It is the 
extremum and the stability conditions that link the various terms in V together. In this sense, 
the stringy dynamics is very important. 

The smallness of Wo, A also leads to a small modulus mass m 2 ~ BWqA in Planck unit. The 
cosmological moduli problem [9 10 constrains the mass of moduli fields to be > O(10) TeV, 
which leads to (with peaked distributions for Wq and A), A > 1CT 30 (in Planck unit) in (1.2). 
Here we see no particular reason (without fine tuning) why the factor (x m — xq) in (1.2) should 
be exponentially small. However, if the corresponding (to (x m — xq)) factor in a more non-trivial 
multi-moduli scenario can naturally be exponentially small, then the moduli masses may still 

2 Although non-trivial solutions allow Wq, A to take ranges of values that include zero, it is probably natural 
to simply have Wo = A = 0, yielding supersymmetric Minkowski solutions. This suggests that the probability 
distributions of Wqi^I naturally peak at zero values. 
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be very small, but they may no longer be closely tied to the value of A. It remains an open 
question whether (and under what conditions) this mechanism can naturally produce a A as 
small as the one observed. It is plausible that there is a collection of mechanisms pushing A 
down and the above mechanism is simply one of them. 

Although the above examples suggest that small As are preferred, they also seem to imply 
equal (or comparable) probabilities for both positive and negative values of A. For the preference 
of a positive value, we may invoke the history of our universe. Cosmological data suggests that 
our universe has gone through an inflationary epoch in its very early stage. So our universe most 
probably started with a relatively large positive vacuum energy density during the inflationary 
epoch; it subsequently moved down along a particular path in the "cosmic landscape" and 
eventually landed at the present small value. Since the universe moved down from a positive 
large A, it passes the small positive A region before reaching the negative A region, so it is 
reasonable that our universe reached a meta-stable vacuum with a small positive A before it has 
a chance to move further down to a negative A vacuum. 

For a random variable that takes discrete values only, we shall assume that the spacings of 
the discrete values are small enough for us to ignore this fine point. In the evaluation of actual 
A in a realistic model, we must make sure that the very small observed dark energy value can be 
reached. Invoking the Bousso-Polchinski argument, we believe this is a valid assumption. Since 
the model we are considering here is still rather crude, we shall not worry too much about this 
important issue in this paper. We do discuss the discreteness issue when it is directly relevant 
to the model we study. 

The rest of the paper is organized as follows. We review and discuss the probability distribu- 
tion of products (also ratios and sums) of random variables in Section [2j in preparation for their 
applications to examples in string theory. The key results are summarized in Table [TJ In Section 
[3j we show how a preference for a small A does not emerge in some toy models. In Section |4j 
we discuss in some detail a Type IIB flux compactification model in the large volume scenario. 
This model was studied analytically recently in |6] (see also |5j[TT]). We present and discuss 
the background necessary for our analysis. In Section |5j we take the viewpoint explained above 
and see how the stringy mechanism for a small A works in a single modulus case. In particular, 
we discuss how the peaking behavior emerges. We consider different plausible scenarios and 
summarize the key features in Table [2| We end with some discussions and remarks in Section 
[6j Some details are relegated to Appendix |Aj 
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2 Distributions for products, ratios and sums of random 
variables 



Let us review and discuss the probability distributions of various combinations of random vari- 
ables, including the probability distribution of the product of random variables, namely, the 
Product Distribution, which will be a key ingredient governing the population behavior of the 
effective potential extrema for moduli stabilization models in string theory. Even if the distri- 
butions of random variables are simple, a product of these variables can become non-trivial. In 
anticipation to our application in a flux compactification model in string theory, we also discuss 
the probability distributions for sums and ratios of random variables. The key results are easily 
derived and they are summarized in Table [TJ 

To begin, we like to specify the statistical properties of the random variables Xj. We shall 
treat each independent, identically distributed (i.i.d.) variable drawn from some sta- 

tistical distribution Qj. Different Xj can have different Qj. For our discussion here, the precise 
choice of flj is not important, provided that the higher moments of Qj are appropriately bounded. 
(These higher moments are essentially the Mellin transforms that will play an important role 
here.) However, it is important whether the zero value is inside flj or not. By definition, all 
distributions for Xj and yi include zero smoothly. For random variables that excludes zero, we 
shall call them Ck, with statistical distribution Clk. For a random variable that takes discrete 
values only, we shall simply assume that the spacings of the discrete values are small enough for 
us to ignore this fine point. 



2.1 Product distribution 

The product distribution may be derived analytically if the original distribution is also known 
analytically. For instance, if the distribution of each random variable Xi obeys uniform distribu- 
tion changing between and 1, the properly normalized product distribution of z = X\X 2 • • • x n 



is obtained by integrating out a delta-function 12 



f 1 1 / l\ n_1 

P(z) = dx\dx 2 ■ ■ ■ dx n 8(xxx 2 ■ ■ -x n - z) = — In- . (2.1) 

Jo (n-1)! V zj 

for < z < 1. If each Xi obeys the uniform distribution in the range [—1, 1], then P(z) is given 



in (jLlJ for -1 < z < 1. 

Generically, the product distribution can be calculated via the introduction of the Mellin 
integral transform of the function f(w) for w > 0, 



oo 

s-1 



M{f(w)\s}= / w s - 1 f(w)dw (2.2) 
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where, for real s, the Mellin transform M{f(w)\s} is simply the (s — l)-moment (u> s_1 ) of the 
distribution function f(w). As an example, let us consider the probability distribution of the 
product of two random variables each with an independent distribution. Suppose each of the 
two semi-positively defined random variables x±, x 2 obeys an independent distribution, satisfying 
fo° fi( x i) dxi = 1, / °° 72(^2) dx 2 — In Then the product distribution of these two random 
variables becomes fl3l 



P(z) 



fi( x i) 72(^2) S(x x x 2 - z) dx 1 dx 2 



Jo 

00 



(2.3) 



fx ( — ) f2{x 2 )dx 2 . 

x 2 \ x 2, 



When we multiply z s and integrate over z, we obtain 14 : 



M{P(z)\s} = Af{/!(xi)|s} • M{f 2 (x 2 )\s}. 



(2.4) 



Therefore the product distribution can be written in term of inverse Mellin integral transforms 

by 



P{z) 



1 



Z - s M{P{z)\s}ds 



(2.5) 



2vri JL 

where the line integral is performed in the complex s plane for z > 0. This easily generalizes 
to the case of the product of n semi-positive random variables Xj each with its own distribution 
fj(xj), z — Xi ■ ■ ■ x n , where 

n 

M{P(z)\s} = l[M{f^)\s} (2.6) 



Following this and (2.2), we see that the expectation value (z p ) = M{P(z)\p + 1} is given by 

n 

(^)=n<4> ( 2 - 7 ) 

3=1 

Using the Mellin integral transformation, we can find the product distribution for normal 
distribution, P nor mai( x i) = h{ x i > 0) = ■sj (2/%af)e~ x ^ 2a i lij 16 . The resulting distribution 
goes as 



P( z ) =N^Gol I : 



N =(2nr/ 2 l[a t 

i=i 



n 



0,0,... ,0 



(2.8) 



3 Although we focus on just semi-positively denned random variables (that smoothly includes zero) for sim- 
plicity, the argument can easily be applied to the entire region. 



7 



where the Meijer G-function obeys the following differential equation 
p / , \ i 



1) 



p—m—n 



W 



n 



w - 



d 

dw 



dj + 1 



n 



w- 



a 1; 



which can also be expressed in the integral form: 



Gp i w 



ai, 
&i, 



f 

2tu 



w 



n 3 m =i r ( a + 6 j)-n" = ir(i- 



q Yd 

j=m+l 1 I 1 



6j-s 



= 0, 
(2.9) 

-ds, (2.10) 



where the line integral is performed in the complex plane. Note that the integration of the 
Meijer G-function is an inverse Mellin transform. Then, the set of Meijer G-function relevant 
for us becomes 

i=i 



„n 

GOn I ffi 



0,0, 







(2.11) 



When n = 2, this function is simply the modified Bessel function of the second kind K (y/w) = 
Ko(z/Y]™ =1 v2(7j). The cases for n = 1,2,3 are shown in Figure 1 

Finally, the distribution for a product of the same random variable itself (z = x n ) can be 
obtained by equating J °° P{z)dz = J °° f{x)dx with a distribution /(a;) for the random variable 
x. The resulting power distribution becomes 

z -l+l/n 



P z) 



n 



(2.12) 



Note that a special case with n = 2 is known as a part of ^-distribution, x 2 (l)- It is a l so worth 
noting that the formula applies even for a non-integral n. 



2.1.1 Asymptotic behavior around z = 



For both the uniform and the normal distributions, we show that the asymptotic behaviors of 
the product distributions have the same divergent behaviors at z — 0. Since the asymptotic 



behavior of the product distribution (2.1) for uniform distribution is explicit, let us elaborate 



on the asymptotic behaviors of the more non-trivial situations. 



The asymptotic behavior of product distribution (2.8 ) for normal distribution can be obtained 
by expanding the Meijer G-function around z = 0: 



P(z)=N^Go°n , 2 f[^ 

\ i=l 1 



0, 0, • • • , I ~° 



1 



2(n-i)inr=i^ w 



ra/2 



In 



n-l 



+ 



(2.13) 



Therefore the product of independent random variables diverges as powers of log functions even 
though the original individual distributions are smooth at zero. Up to the normalization factor, 
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the divergent behavior at z = is the same as that for the uniform distributions ( 2.8[ ). This 
property is intuitively clear. So it follows that if P(yi) ~ [ln(l/?/i)] ni and Pfa) ~ [hi(l/l/2)] n2 , 
then the probability distribution of z — y±y2 > goes like, as z — > 0, 



P(z) ~ IHl/z)} 



ni +712+1 



However, the diverging behavior changes once some random variables are raised to some 



powers. As we see immediately from (2.12), the diverging behavior appears as negative powers 
of z if the original distribution is smoothly finite at z — 0. To be specific, let us find out what 
happens in the case of the product o where both x\ and X2 have the same uniform 

distribution from to 1, as an example. The distributions for the power of each parameter are 



given in (2.12) with f(xi) = 1. The resulting distribution can be calculated by 

,1 ^1 — l+l/m -1+1/n 

m n (2U) 



P(z) = / / — 5(wiW2 — z) dwidw2 



o jo 
1 



m — n 



f z -l+l/m _ z -l+l/n\ 



This formula also applies at m = n. At this limit, the function goes like z- 1+l / n \n(l/z) as 
which is slightly more diverging than that for just a simple power of a single random 
variable. Repeating this calculation, we can find the probability distribution for z = x™ 1 • • • x^ m , 
the m-product of arbitral powers of x^ with each Xi obeying the same uniform distribution 
from to 1, 



n m ~ 2 



P(z) = T fr -^ -z~ 1+1 ^. (2.15) 

When we set rii = n for all n^, the function goes like z~ l+l l n {hiz) m ~ 1 . We expect that the same 
divergence is obtained in the case of normal distribution. 

Let us summarize the divergent behaviors of the various product distributions in Table [T] 
The product distributions are in general divergent at z = 0. The divergence becomes powers if 
there exists a product of the same random variable inside z, which is more divergent at z = 
than the logs. It is worth noting that the power behavior must be less divergent than z~ l , which 
is necessary for P(z) to be properly normalized. 



2.1.2 Divergent behavior versus peaking behavior 

In general, a distribution is more peaked at z = if it is more divergent there, so the divergent 
behavior at z = provides a very useful measure of the peakiness (or the peaking behavior) of 
the distribution. However, they are not equivalent. The divergent behavior is mathematically 
defined independent of the details while the peakiness does depend on the actual distributions of 
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z 


Asymptote of P(z) at 2 = 






x x 


z 


x l " ' ' x m 






1+1/m r -l+l/n\ //_. „\ 
yz ' — z 1 j/yiii — ti) 


xi x m / y\ - ■ -y n 




x i /Z/i 


,,-1+1/m 


x\ 1 + ■ ■ ■ + x" m 


^— l/n m 


X1X2, < c = X\jxi < 00 


smooth 


X1X2, < c = x\jxi or c < 00 


MVM) 



Table 1: The asymptotic behaviors of the probability distributions P(z) of the product, ratio and sum of 
random variables Xj and yj which smoothly include zero. The product distributions have divergent behaviors 
at z = in general. The divergent behavior of ratio distribution is simply that given by that of the numerator. 
The sum distribution works toward erasing the divergent behavior of each term. If there is a constraint among 
some random variables, the divergent behavior of product distribution may be erased. 



the random variables. In general, the peakiness becomes stronger for more powers of the random 
variables. Instead of providing a definition for the peakiness (we shall introduce a measure on 
the peakiness suitable for the A problem at hand in Section [5]), let us illustrate with an example. 
For simplicity, let us compare the distribution of z — x\ and z = x\x 2 - Assuming uniform 
distribution from to 1 for x\ and X2, the product distributions are obtained by (0 < z < 1) 

P(z = x\) = P(z = x\x 2 ) = z- 1 ' 2 - 1. (2.16) 

Although the distributions for both z = x\ and have the same z ' divergent behavior 

at z = 0, the distribution for z = x\x 2 is more peaked there, as we see in Figure [2] We see that 
the peakiness depends on both the normalization of the divergent term as well as on the tail 
behavior (values away from z = 0). 

As another example, let us calculate the expectation values of the following two cases, as- 
suming uniform distributions for all Xj > 0, 

dz —(in- I = — , 

(n-1) V z 2 n ' 

v ' v ' 2.17 

1 z -l+l/n 1 

dz z = 

o n n + 1 




where the first line follows from (2.7). Although the product distribution has only a log di- 
vergence versus the ^ _1+1 / n power divergence of the power distribution, we see that P(z) for 
z = niLi x i i s more peaked than that for z = x n . In general, we expect more population in 
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Figure 2: A comparison of the peakiness (or peaking behavior). The left hand side shows the product distribu- 
tions for z while the right hand side shows cumulative probability distribution beginning from z = 0. The curves 
are for z = X\ (solid horizontal line), z = x\ (red dashed curve), and (blue dotted curve) in each plot, 

assuming uniform distributions for x\ and X2- ■ 

smaller z if z involves more random variables rather than more powers of the same random 
variable. 

If each uniform distribution has a lower bound e such that e < Xj < 1, the expectation value 



becomes (using (2.7)) 



]>A=^(i+6r. ( 2 - 18 ) 

,i=l / 

Therefore, if e is small enough, the expectation value, which may be treated as a measure of 



the peakiness, is roughly the same as that given in (2.17); however, the probability distribution 



P{z) no longer diverges. Instead, P{z) ~ [ln(l/;2)] n 1 peaks at z ~ e and then drops to zero at 



2.2 Ratio distribution 

Next, we describe what happens if random variables come in denominator of z. First example 
we consider here is a combined variable given by z = x™/y™. We consider the parameters x\, y\ 
obeying uniform distribution in range from to 1. The distributions of the numerator w\ = x™ 



and the denominator w 2 = can be calculated independently following (2.12). Therefore, the 
distribution of z is given by 

P(z) =J o J o P{wi = O P(w 2 = yl) 5 - z\ d Wl dw 2 

_J_ -l+l/m f q < < j ( 2 - 19 ) 
m+n — — ' 

_A^ z -l-l/n f 1 <• z _ 
m+n — 
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The divergent behavior at z = is controlled by the divergence of the numerator, as is intuitively 
clear. 

Next we consider the other combination given by z = Xl "' Xm where all variables obey uniform 
distribution in range from to 1. The distribution of each numerator W\ = x\---x m and 
denominator w 2 = yi ■ ■ ■ y n is given in (2.1). Again integrating over the distributions with the 
ratio constraint, we get 

P(z) =11 P{w! = x?) P{w 2 = yf) 8 ( — - z\ d Wl dw 2 

J0 J0 \ w 2 J 



1 



. — 1+m+n 



(In z y l+m+n U(n, m + n, -2 In z) for < z < 1 (2.20) 



T(m) 



/ 1 \ — l+m 

(m- l)!(n- 1)!2" v ; 
where U(a, b, z) is the Tricomi confluent hypergeometric function, defined by 

1 f°° 

U(a,b,z) = —- e- zt t a - 1 (l+t) b - a - 1 dt. (2.21) 
T(a) J 

We see again that the divergent behavior is nothing but the one from the numerator. Note 
that we have omitted the expression for 1 < z because the integration is difficult to be achieved 
analytically. 

Therefore we may conclude from these examples that the essential divergent behavior in the 
ratio distribution is dictated by that of the numerator. If the denominator itself has a divergent 
behavior, that tends to impact on z away from zero. The asymptotic behavior of the ratio 
distribution is summarized in Table [U 

Since the normal distribution has a finite non-zero value at Xi = 0, just like the uniform 
distribution case, the essential argument for the divergent behavior should be same. So far we 
have concentrated on just the uniform distributions for each random variable. This is because 
the expression for normal distribution is somewhat more complicated and cannot be achieved 
analytically in most cases. However, if we limit ourselves to particular cases, we can perform 
the integration even for the normal distribution cases. Here we show an example in the case 
for z = Xl "' Xn where Xi,y, obey normal distribution centered around with variance 1. Let us 
calculate the simple ratio distribution of W\ = X\jy\ first. This is obtained by 

P(Wl)= ! [ Pnormal(Xl)Pnormal(2/l)5( — ~ ™1 J = "r^-j. (2.22) 

Jo Jo \Vi J nl + w( 

Using this distribution, we can find out the distribution for z = Xl "' Xn as a product distribution 

yi"~yn 
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iteratively, by 

P(z) =/•••/ P{w 1 )---P{w n )8{w 1 ---w n -z) 

J^ZU (2k-iy. nti [(2* - 2) 2 tt 2 + (21n^) 2 ] for n = 2k, 

n?=i [(2A; - 3) 2 tt 2 + (21n^) 2 ] for n = 2k-\. 



7r 2fe-i( 1+2 2) ( 2 fc-2) 
/ 1 \ra— Ion 

z ~ o(_1) 2 r (inzr i + 



(2.23) 



ir n (n - 1)! 

Therefore we see again that the divergent behavior is nothing but that given in the numerator. 



2.3 Constrained product distribution 

If we have a constraint in the product distribution, the divergent behavior will be modified. 
Properties of a constrained system will come into play in the actual string example to be dis- 
cussed. Since a general analysis is quite complicated, we shall just illustrate some features of 
the constrained system with a simple example. 

Here we assume x±,x 2 are random variables obeying uniform distribution from to 1. Then 
consider a product distribution of z — x±x 2 , under a constraint c = x\jx 2 . We defined c as a 
constant, but which can have a range. Let us first calculate the probability distribution function 
at a particular (z, c) first, 

P(z, c) — dx\ / dx 2 5(xix 2 — z) 5 ( — — c 
Jo Jo 



x 2 



f 

Jo 



cz \ V c 



1 r 1 

= — for z < c < 1, or z < - < 1. 
2c - - ' - c - 

If c has a range for the constraint, the distribution for z is given after integrating over the 
region for c with suitable normalization. First we assume 1/2 < c < 1, for instance. Then the 
probability distribution function with respect to z becomes 

1 \ f 21n2 forO<z<i , 

z,-<c<l)=< . ~ 2 2.25 

Suppose the upper bound for c is bigger than unity, let us take 1/2 < c < 10 as an illustration; 
then the probability distribution function is given by 

f fin 20 for0<^<^, 
P(z,-<c<10 \ = \ flnf for^<^<i, (2.26) 
V 7 [ f lni for|<z<l. 
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As we see from above, generically the distribution does not diverge at z = 0, if c passes through 
neither zero nor infinity. Although the product distribution suggests a divergent behavior as 



discussed in Section 2.1.1 a constraint may work to erase the divergence in the product distri- 
bution. 

However if c passes through either zero or infinity, the divergence behavior is retained. When 
1/2 < c(< oo), the distribution goes as 

„ / 1 \ f | In? for < z < |, , _ 

P U < c = J 3 * - -2' 2.27 

V 2 ~ / \ | In \ for | < z < 1. V ; 

While, if < c < 1/2, we get 

P (z,0 < c < 1 J = 2 In 1 ioY§<z< X . (2.28) 

We see that the probability distribution function diverges as In z when c passes through either 
zero or infinity, or both. This generic feature of the constrained system is summarized in Table 

m 



2.4 Sum distribution 

So far we have explained what happens if the distribution is given as a product of some random 
parameters. Here, we shall elaborate on the sum distribution. This sum distribution is crucial 
if the population of vacuum consists of summation of a number of quantities, e.g. quantized 
fluxes considered in [l]. 

The sum distribution is understood rather easier than the product distribution. If there are 
two independent random parameters satisfying J" °° fi(xi) dx\ = 1, / °° f 2(^X2) dx2 = 1 (finite 
range for uniform distribution), the sum distribution of z = x% + X2 becomes 

/*oo ^00 

p i z ) = / fiixi) f 2 (x 2 ) 5(xx + x 2 - z) dx t dx 2 

./o Jo ^ 2 g) 
fi(z - x 2 ) f 2 (x 2 ) dx 2 . 



Let us apply this formula to a case which consists of a summation of the arbitrary powers 
of uniformly distributed random variables, where each term has the divergent behavior given 



by the distribution (2.12). The probability distribution of the m-summation of n^-powers of the 



random variables, z = x™ 1 + ■ ■ ■ + is given by 

,1 m -l+l/n 



jQ — S(wi + ■ ■ ■ + w m — z) dwi ■ ■ ■ dw m 

' i=1 Ui (2.30) 

= II£ir(i + i/ni) i /ni 
r(E^iVn l ) 
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Figure 3: The probability distribution of A in the Bousso-Polchinski type (3.1 1. The probability distribution 
in the J = 1 ( J = 10,20) case peaks at zero (~ 50,200). Population moves toward large A as the number of 
types of fluxes J increases. As a result, tiny cosmological constant is allowed but not preferred in the presence 
of more than one cycle. 



Thus we see that the singular behavior at Xj = for individual terms tend to be smoothed out at 
z = as the number of terms m in the summation increases. For instance, if n\ = n<i = 2, then 
summation of two terms are enough to suppress the divergent behavior. This smoothing behavior 
in the sum of product distributions also applies to the other distributions. The combined 
distributions including the sum distribution is summarized in Table [T] 



3 The Bousso-Polchinski example 

Let us move on to the population argument in the string landscape based on the flux quantization 
[Tj (see also a review (2|3]). In this toy model, the cosmological constant is given as a combination 
of bare constant Ab are and four-form fluxes contribution, by 



1 

A = Abarc + -5>^ 2 (3.1) 

1=1 

where g« are basic quantity assigned for each cycle after compactification, and are quantized 



integers related to fluxes of the cycles. A simple version of this model was constructed in 17,18 

First, for fixed qj ^ and Ab are = 0, we can read off from Table [TJ that the resulting 
distribution does not have a peaking preference for A = 0. Adding a negative Ab are will not 
introduce a peaking behavior at A = 0. It should be intuitively obvious that the situation 
would not improve in general. To see this more explicitly, let us impose randomness for the 
parameters A bare , n^, g» and see how it affects in the probability distribution of A. To be specific, 
we assume that the qi obey normal distributions centered around with variance 1, and A bare 
obeying uniform distribution from —100 to 0. Since the rij are quantized numbers, we give 
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random integer values for rij from to 10. Using these distributions, we plot the histograms of 
A at J = 1, 10, 20 respectively in Figure |3j 

The histogram illustrates our argument for the sum distribution: although the distributions 
of quadratic power itself for individual terms give us diverging behavior at A = point (notice 
that A = is not the origin of the histogram in Figure pi). Once there are more than one 



term in the sum in (3.1), the divergent behavior is being smoothed out. The preference of large 
cosmological constant is also an outcome of sum distribution, because all terms related fluxes 
are positively defined. But if we focus on J = 1 case, A = is mostly preferred among the 
population for A > owing to the divergent behavior of the product distribution for z = n\q\. 
However, as we expect a number of non-trivial cycles in Calabi-Yau compactifications easily, 
such the divergent behavior is quite likely smoothed out, and no preference for A = remains 



in this type of models. A similar plot was obtained for the J = 7 case in 19 



More generally, suppose the effective potential involves 2 or more components so that each 
involves only a subset of the moduli, e.g., 

V( ( f> i , ( p j )^V l (<f> i )+V 2 {<f j ) (3.2) 

where the two sets of moduli do not couple to each other. Even if the minimum of V\ is peaked 
at V\ — and the minimum of V 2 is peaked at V 2 = 0, it is possible that there is no peaking 
at the minimum at V — 0. So, for A = to be preferred, the sets of moduli should be coupled 
together. In general, if we include the interaction (loop contributions and a' corrections), all 
moduli should couple (at least indirectly) to each other. 

Since moduli stabilization requires some relations in all terms in the potential for both 
extremal conditions and stability constraints, we will argue how product distribution and its 
divergence behavior dominate the population in the stringy landscape while freezing randomness 
for summations, especially using a class of model in type IIB. It is possible that more coupled 
moduli in V will lead to a stronger peak at A = 0. This feature obviously depends on the 
detailed dynamics. 



4 The Kahler uplifting in the large volume scenario in 
Type IIB theory 

In this section, we like to study a specific example in string theory, in particular how the 
extremum and stability constraints impact on the basic idea of preference for small As. The 
formulae obtained in this section will be used for the probability distribution of A in Section [5] 
. Since supersymmetry can remain for negative A, their presence may mask the feature we are 
interested. So let us focus on the positive A case, and then comment on the negative A case. 
As we shall see, the case we study automatically extends to negative As. 
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There are some efforts to understand the potential for moduli stabilization in type II string 
theory. For necessity condition toward stabilization of all moduli, we may focus on two universal 



moduli consisting of a Kahler and dilaton. The extremum 120 ,21 at positive A and stability 22 



constraints are studied to limit models in string theory (see also an application for higher 
dimensional dS vacua [23]). 

To find the probability distribution of A, we have to start with an explicit relatively simple 
model that has a de-Sitter vacuum solution. Here we choose to illustrate the peaking behavior 
with a single modulus model with Kahler uplifting in the large volume scenario (LVS) in IIB 
string theory [4|^8] , focusing on positive A vacuum solution. We show how the peaking behavior 
of a product distribution appears when the potential has more than one term in it. We show 
also that the constraints typically modifies but not erase the peaking behavior. Let us briefly 
review the large volume scenario before focusing on the single modulus case. 

One may consider the KKLT scenario |!24| instead of LVS. KKLT introduces a non-perturbative 
term in the superpotential to stabilize at supersymmetric vacuum, therefore we may expect the 
tree level contribution and the non-perturbative contribution to be comparable: Wo ~ W np . 
Thus Wq is expected to be exponentially small. The stable positive A vacuum is achieved by 
adding some D3-branes at the tip of throat as an uplifting term |^] On the other hand, LVS 
allows the relation W ^> W np so that we can consider naturally large Wq after stabilizing the 
complex moduli. This relation gives us an advantage to make the potential relatively simple, 
especially at smaller A. 



4.1 The Kahler uplifting in the large volume scenario 



The large volume scenario is present in type IIB string theory. Owing to the no-scale structure, 
additional effects beyond the classical potentials are helpful to stabilize the Kahler moduli, while 
complex structure moduli can be stabilized at tree level. Here we consider an additional non- 



perturbative effect in the superpotential, originally introduced in 24 , and also a'-correction in 



the Kahler potential [36]. These two effects break the no-scale structure at next leading order 
in the potential, therefore Kahler moduli can now be stabilized. This fact enables us to impose 
a mass hierarchy between the Kahler moduli and the complex structure moduli so that we can 
safely assume that complex moduli are stabilized at higher scales. Thus we focus only on the the 
Kahler sector in this paper. More detailed analysis in the presence of stringy loop corrections 



were considered systematically in 37-39 . Some advantages for phenomenological applications 



of LVS in terms of mass scale hierarchies are available in 40 , 41 



The model is clarified to have a dS stable vacuum in a particular parameter region of minimal 
The backreaction of D3-branes has been recently discussed in 25 -32 . See also the argument of D6-branes 



33 35 . 
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configuration [5j|6 11 , satisfying the large volume assumption which is important to deal with 
the a'-correction suitably (note that a Kahler transformation in the system can increase the 
volume easily) A dS minimum is achieved because a term related to the a'-correction can 
contribute positively in the potential. Such a possibility was also analyzed along the Goldstino 
direction |43l. 



I Mr 



The effective potential for the Kahler moduli is given by 

3 




K = -2M 2 \n V + | , (4.1) 



£ = C(3) , /2 X (M) ~ 8.57 x 10- *^ 

4^(2^)3^ 9 r 

where x(^0 is Euler number of the manifold given by x{M) = 2(h 1 ' 1 — h 2 ' 1 ), which can be 
negative. Since we assume these complex moduli were already frozen at high scale, we consider 
£ as just a parameter here though £ has the dilaton dependence. It is worth noting that the 
V is a dimensionless volume measured in the a' unit. If we recover the dimensionality of the 
volume by redefining V = vol /a' 3 , we see immediately that the £ parameter is suppressed by 
a' 3 relative to vol, so it may be treated as an a' correction. Note also that the Planck scale is 
related to string length by Mp = AttV / (g 2 (27i) 2 a') where we consider V as given after moduli 
stabilization so that Mp is the constant. If we fix a' as motivated by the string theory setups, 
the Planck scale is defined differently depending on the V; that is, it depends on the vacuum 
expectation values of the moduli fields. However, since our focus is the cosmological constant 
observed in Planck unit, we shall consider the dimensionless ratio V/Mp, in which the Planck 
scale dependence is cancelled out. 



We consider the volume of this model as that of the Swiss-cheese type, defined by |5||6[ 40 

1 

"6 



V =i"*(«') 3 = Tim + Ti) 3/2 - Yl + T^ 2 , 



t=2 



, (4-2) 
U =ReT = d v iV, 7j = — j== 

where the self-intersection number Km are defined Km > while Km < (i > 1). The superpo- 
tential includes non-perturbative terms for the h 1,1 number of Kahler moduli: 

W =M 3 (w + Ae~ a ' Tl j . (4.3) 

The constants a« are given depending on the way of getting the non-perturbative terms: a« = 
6n/f3i by gaugino condensation on D7-branes, where /3j are the one-loop beta function coefficients 



See also 42 for the argument in the presence of a dilatonic non-perturbative term. 
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of the condensing gauge theory (/% = 3iV c for SU(N C )), and o, = 2ir by an instanton effect on 
Euclidean D3-branes, for instance. We may take aj = 2n/N c for integers N c > 1. The parameters 
W^o; A are functions of complex moduli and open string moduli. Here we treat them as constant 
parameters owing to the assumption that the complex moduli are stabilized at higher scale. 



4.2 The single modulus model 

Let us consider the single modulus case of the above scenario, with only the modulus T\. This 
case has been studied analytically in (6). It is easy to see that we have a minimum at the 
origin in the axionic direction of T 1; so we shall set ImTi = and consider only the real part 
t\ = ReTi. Here we set a' = 1 for simplicity. First, let us expand the potential as 



V V 



2 



X 



Mp W 2 




(4.4) 



+ ■ ■ 



If the potential is controlled by small factors like A\e aitl /Wo and £/V, we can focus on first 
two terms in the approximate potential |6], 



M P 2i\t\ 64v / 27 1 3 ^ 9 / 2 

W Q a\A x ( 2C e~ 



2 7l 2 \9x g / 2 x 2 
-21W,jaT 

KJ = , X = til tl . 

64^27^! 



(4.5) 



where we redefine several parameters. (Comparing to (1.2), we see that A = —Ax.) The 
existence of dS solutions requires 

- WqA y > (4.6) 
which we shall impose together with £ > 0. 



6 As we shall see, this —WoA\ > condition also allows AdS stable solutions, while Wo-^i > allows AdS 
but not dS stable solutions. So allowing WqA\ > will not change, up to an overall normalization, P(A) for 
A > 0, but will change P(A) for A < 0. 
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Figure 4: The plot shows behaviors of the bracket in ((45} (see §) at C ~ 3.55,3.65,3.75,3.85,3.95 from 
bottom to top respectively, for an illustration of the stability constraint. 

4.3 A for the stable solution 

Now both the extremal condition and the stability constraint can be solved analytically. The 
derivatives of the potential are given by 



Ox 

d x d x 



V 


W Q a\Ai 


Mj, 


27? 


V 


W a a\Ai 


M% 





X 



11/2 



+ 



x ' 



(4.7) 



The C and x are related through the extremal condition d x V = 0: 

C = x 5/2 (x + 2)e~ x . 



(4.i 



Combining with the semi-positive condition V > and the stability constraint d 2 V > 0, C and 
x should be within the ranges 



3.65 < C < 3.89, 2.50 < x < 3.11. 



(4.9) 



where the lower bound is for V > and the upper bound is for d 2 V > 

The stability of the potential V(x) (up to an overall factor) is shown in Figure 111 where we 



plot the behaviors of the bracket in (4.5). The constraints V > and d V > force the bracket 



in (4.5) to satisfy 



2(7 e - 
" 2l 9^-^ 6 - 25Xl0 " 4 - 



(4.10) 



This bracket in the potential (4.5) can reach negative values if we allow x < 5/2. 



Next we consider the minimum of the potential (4.5), especially at the vicinity of A = 
Vmin = 0, where C ~ 3.65, x = 5/2. By using the extremal constraint (4.9), we can expand the 
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minimum of the potential (4.5) around zero cosmological constant: 

A e" 5 / 2 (2\ 2 -W a\A x 



M% 9 V 5 / 7i 
1 /2\ 9/2 -W^a\A x 



x 



9 \5J 7 2 

5/2 




C-^k +••> (4-11) 



C~(§) e- 5 / 2 (x + 2) + 



For generic values of the parameters 71, a x , A\, Wq and £, A/Mp may be small, say, of order of 
— 10~ 4 . One can always tune x (and C) so that A/Mp ~ 10 -120 , but this is fine tuning. 

Before going to the random variable analysis, we would like to comment on some features. 
If we focus on the non-perturbative term from SU(N C ) gaugino condensation on D7-branes, 
we have a\ = 2n/N c . From the stability constraint for de-Sitter vacua (4.9), we may roughly 
estimate C ~ 4, x ~ 3. Therefore the magnitude of compactified volume is roughly given by, 

3 / 3 \ 3 / 2 

h~—N c , V~7ihH N c 2 - ( 4 - 12 ) 



2vr ' V2vry 

Thus the volume is easily getting larger as N c increases. Solving A\ (using C ~ 4) in the 
potential (4.11), we obtain 

, _ 9/2 OV n I / •:• \ " 2 W2 

M 7 ' ~ 



^ ~ - ( 2 -) 9/2 ^1 ( aV /2 "f.v - „ 4 . 82 x 1() v»l^ A . (4 . 13) 

i4 9 V5; 64V/2X4V27T; 7? 7i 



Therefore a large A^c yields a small cosmological constant. However, just taking N c very large 
is a fine-tuning in the model. Instead of fine-tuning, we randomize the parameters in several 
ways, and analyze the statistical behavior in this model. 



5 The probability distribution of A 

The question we are finally ready to address is: 

Without fine tuning, what is the preferred values of A, if the parameters in the model 
are treated as random variables ? 

Here, our discussion is restricted to the single modulus model just reviewed. 
5.1 Probablity distributions of Wo and A\ 

For the stringy mechanism to work in this model, it is crucial that the parameters Wq and A% 
can take zero values. So let us make some remarks here about zeros of Wq and Ai, which depend 
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on the complex structure, the dilaton, positions of D3-branes, etc.. In general, it is simplest and 
probably most natural to have Wo — A\ — 0, yielding supersymmetric Minkowski solutions. So 
we do expect zeros for them to be very likely. Since other values are also possible, they will have 
some non-trivial probability distributions, but these distributions should naturally peak at zero 
values. 
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Though Wo is different in different models, we focus on a simple example of T 6 [5}|6 
After solving the equations for supersymmetric minima, we get 

W = 2du(l-s) (5.1) 

where du is a combination of two parameters involving the expectation values of the real parts 
Ui of the complex structure moduli Ui, and s is the expectation value of the real part of the 
dilaton |6|. As we see immediately, the Wq can pass through zero easily, as a result of s = 1 or 
the combined parameter du = 0. Even in this simple example, we see that Wo = is allowed in 
general. Furthermore, because of the allowed zeros in du and 1 — s, we expect the probability 
distribution P(W Q ) of W to be peaked at W = 0. 



On the other hand, the dependence of the position of D3-branes in A\ was discussed in 45 
The resulting dependence goes like 

A 1 =A 1 (U i )(f(X i )) 1/n (5.2) 

where Ai(Ui) is a function of the complex structure moduli Ui and /(Aj) specifies the positions 
of D3-branes. For instance, in the conifold, we get 

4 
8=1 

where Pi G Z, q = YlPi> an d A* specifies the location of the D7-branes in the throat (note that 



we used the different notation from 45 to avoid confusions in the other sections). The case 
f{Xi) = is known as Ganor zero, which is the special case when 3-branes dependence on the 
superpotential turns out to be zero [46] . So the A\ parameter itself naturally passes through 
zero in string compactification. Since we may expect allowed zeros in A\ also, the probability 
distribution P{Ai) of A\ may peak at A\ = 0. For simplicity, we shall assume in the discussion 
below that both Wq and A\ have uniform distributions that include zero. This is a rather 
conservative input. 



5.2 The distribution of A 

Let us define 



(ilv'2 



w i = ~57?> w 2 = A 1 , w 3 = -Wo, w 4 = 4, c= = — — C. (5.4) 

j*> 6 w 2 27 
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so A (4.11) can be rewritten in terms of the dimensionless A, 



t A 3 3 , , 200^ 100 ( . 

A = — r ~ —W1W2W3 (c — Cn) , c = ~ 12.2. (5.5) 

Mj, 2500^5 11 ' 3e 5 /2 

Neglecting the overall constant factor in A, we reduce the problem down to the constrained 
product distribution given by 

3 /2 

A OC W 1 W2W3[C — Co), c= , Co < C < Ci, (5.6) 

w 2 



where Co ~ 12.2 as given in (4.9) for the potential (4.5). Since here we restrict ourselves in the 
region for smaller cosmological constant, C\ cannot be so large, but we allow c to have some 
range. The value of C\ fixes the coefficient of the most divergent term, and is irrelevant for the 



divergent behavior itself as long as c± is finite. Stability condition (4.9) requires that c\ < 13.0 



Now we start imposing randomness in the constrained product model (5.6). Although we 



can deal with iui, iu 2 , u>3 as random parameters passing through zero, there is a concern about 



u> 4 = £. The definition of £ include dilaton dependence as in (4.1), therefore the parameter 
may be treated as random parameter in a range of finite values except for zero. This is because 
zero-ness of £ is given as a result of zero value of Euler number, therefore it is difficult to expect 
that £ smoothly passes through zero. If there is a parameter in a product which does not pass 
through zero, that parameter is irrelevant for the divergent behavior of the product at A = 0. 
Thus we neglect the dependence of for the moment. We shall include 104 back in Case 1 in 
Appendix [Aj 

We may further consider the possibility that the cti parameter has a lower cutoff, as a result 
of the size of the gauge group which contributes to the gaugino condensation on D7-branes. If 
the parameter ai = is excluded, a\ (or w{) cannot contribute to the divergent behavior in 



the constrained product distribution in (5.6). Instead of treating W\ as a random parameter, 
we consider SU(N C ) gaugino condensation on D7-branes to generate the non-perturbative terms 
with fixed N c and set 71 = 1 so that W\ = 2tt/N c for simplicity. We shall treat a\ (or w\) as a 
random parameter (in 2 different ways) in Case 2 and Case 3 in the Appendix [Aj 

This means that there are a number of possibilities that we should consider. We study 
plausible scenarios and summarize the results in Table [2j Below we discuss the simplest scenario 
and relegate the other three scenarios to Appendix |X} We see that each scenario yields a peaking 
behavior (at zero) for the probability distribution of A, though the specific divergent behavior 
differs under different assumptions. 
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Model 


Random variables 


Asymptote 


V x 1 




5.7 




7/?o 7/Jo 


- In A 


1 09 x \Q~ 5 f(2ir/N ) 9 / 2 


( 


A.2 


) 


w 2 ,w 3 , W 4 


-In A 


7.72 x 10- 6 (2tt/AT c ) 9 / 2 


( 


A.5 


) 




A~ 4 /9 


5.09 x 10" 6 £ 


( 


A.9 


) 


1/Vm <Wi= 1/2/1 < 1,102, w 3 




2.50 x 10" 6 £ (y m -» 00) 



Table 2: A summary of the peaking property in the constrained product distribution (5.6 1 for the 4 plausible 
scenarios. The second column shows the parameters that are being treated as random variables. We classify the 

asymptote of probability distribution P(A) and the square root of the variance \ j ( A 2 > for each set of random 



variables among wi, W2, 11)3, 104. They have uniform distribution except for w\ in the last case (A.9) where 
Hi = l/w\ has a uniform distribution. 



5.3 A simple scenario for the parameters in V 

With the above simplifying assumptions, A = 3^/2500a/5 where 



2tt 



w 2 w 2 (c- c ), c 



* f2n\ 3/2 w 3 
\N C J w 2 



(5.7) 



That is, we treat only w 2 = A\ and w% = —Wq as random variables while the other quantities 
are fixed parameters. Performing the conditioned integrals, we get 



P(z, c) = J dw 2 dws 5 

'N c \ 3 1 



2tt 
Ac 



w 2 w 3 (c - c ) - z 



)'('(S"5-) 




for < z < r 1 if 

2c(c-co) — — s \N C 



3/2 



2c(c— co) 



for 0<*<c(f J c>e 



9/2 



e(c-co), c<£y Nt 
3/2 



2 71 



3/2 



(5.f 



Since the region of our interest is given by c > Co ~ 12.2 > £(27r /A c ) 3 / 2 (assuming £ < 1, 
A c > 1), we can perform the integration with respect to c (see the left hand side of Figure [5] for 
an illustration): 



P(z) =A - 1 



Cl 



dcP(z, c) 



1-z£- 1 (N c /2ti) 9 / 2 



: 1 (Nc 
2tt 



9/2 



In 



0<2;<£ 



/27ry /2 , 

U) 6l> 



A n 



dz 



dcPiz, c) 



l-*f- 



L(JV c /2tt) 9 / 2 



* \N C ) 2c ( 



(5.9) 



Ci — c 
01 = > 

Cl 
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where Nq is the normalization constant. If we input the value obtained in (4.9) for c\ ~ 13.0, 
we have b\ ~ 0.0610. 



If we recover the coefficient appearing in (5.5), the probability distribution for A can be read 
off by using a relation: 

p(k) d(k) =P(z)dz. (5.10) 



(5.11) 



Now the product distribution for A is written by 

9/2 



p(A , 2500^5 r 3 /2.y/^ 

V I 3f>! s \2kJ 2500V5 5 \N C ) A 



So we see that the probability distribution for A peaks at A = with a divergent behavior 
lnl/A. 

Note that we have ignored that A can be negative. The probability for negative A is actually 
larger than that for positive A. It is easy to extend the calculation to include it. However, we 
assume that the universe starts in the inflationary epoch with a relatively large vacuum energy 
density so it will reach the small positive A region before the negative region. This allows us to 
simply focus on the peaking behavior of -P(A). 



Next we consider the expectation value of A, using the distribution (5.11). The value for 
positive A is given by 



A 



,2500^5 J H N c 



d A APIA 



2500^ \N t 



(5.12) 



Plugging the values in (4.9), we get 



~ /2vr\ 9/2 
A > -8.19 x 10- 6 £ f-j 



(5.13) 



where £ in (4.1) suggests that £ ~ 10 3 to 10 is not unreasonable. We can also calculate the 



variance of this system by 



A 1 



2500^ 



)>(1)1 



1.19 x 10~ 10 £ 2 



2tt 



(5.14) 



Although the ratio 2tt/N c might make the quantities much smaller as a fine-tuning, we can 
conclude that the expectation value (5.13) has the typical scale 10~ 5 which is the outcome 



of moduli stabilization (dynamics) of the potential. It is interesting that the value obtained 



here is close to the constrained value of the bracket in the potential (4.10) though we take into 



account the probability distribution of each parameters. This is because the model we considered 
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include just two random variables as inputs. Once there appear multiple variables product in 
the coefficient of the entire potential, we can easily expect that such the many random variables 
contribute to make the cosmological constant smaller. 



Since we linearize the function for A(c) around the vicinity of A = to (c — Cq), -P(A) (5.11 ) 
is a good approximation for relatively small A. Note that here we apply the upper bound c\ 
given by (4.9) for convenience. If we compare the potential (4.5) and the approximated potential 
(4.11), the difference in A away from zero may differ a bit, but will not change the essential 
behavior around zero. 



5.4 A measure on the suppression of A 



For our application to the A problem in string theory, we like to introduce a measure on how 
much the moduli stabilization dynamics will suppress the value of A. For A = Tlf =1 x™\ let 



Q= 7>g 



(A 2 ) 



„2 \ n N 1 



(5.15) 



where Q measures the order of magnitude the average magnitude of A is suppressed relative 
to its naive expectation, and we use log with base 10. We consider \/Jxf) instead of (x.;) so 
that the divergent behavior of negative cosmological constant toward A = is also taken into 
account to argue the preference of small magnitude of A. Q For example, (2.7) yields Q = 
for A = x±X2 ■ ■ ■ x n where Xi are independent random variables. In the absence of moduli 
stabilization, Q < generically. In the presence of moduli stabilization, A is meaningfully given 
only at the meta-stable minimum of the effective potential. Both the extremum and stability 
conditions introduce constraints., which can enhance the value of Q. 



Applying to the problem at hand, we have, referring to (5.5) 

Q = z ]og K) 3 K 2 )K 2 )((c-c ) 2 ) 



(5.16) 



(wfwjwj(c- c ) 2 ) 

where the expectation values in the numerator are evaluated before imposing the constraints, 
while that in the denominator is evaluated after imposing the constraints. 



Let us consider the potential (4.5) before solving for V m \ n . Apriori, the bracket involving C 



and x can take any value without bound. However, the approximation for a valid V probably 
breaks down when V/Mp exceeds unity. So let us simply assign the naive expected value to be 



of order unity. On the other hand, we find that the actual expected value is given in (5.14) 



A 2 ) ~ 1.09 x 10" 5 £ 



(5.17) 



7 Here we consider only the vicinity of A = 0. In this region, we may expect that the probability distribution 
of negative A has approximately same behavior as that of positive A, even though there is a concern about BF 
bound, above which some tachyonic modes are allowed. 
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If there are no constraints for both extrema and stability, the potential value is expected of order 
of Planck scale because the system we consider here breaks down above Planck scale. Then the 
numerator in the log function of (5.16) is expected to be about 1 in Planck unit. Thus we see 
that Q ~ 4.96 — log£ + 9/2 log(iV c /27r), which means that the string dynamics in the single 
modulus case can easily reduce A by five orders of magnitude or more. 

Although so far we consider up to only four random parameters, these random parameters 
themselves may be non-trivial functions of the other stabilized moduli in flux compactification. 
Following the argument around (2.17), we see that additional random parameters (rather than 
additional powers of the same random parameters) may suppress the expected value (5.17) 
further. 



6 Discussions and Remarks 

In this paper, we study a single modulus Kahler uplifting model in the large volume scenario 
within Type IIB flux compactification. We study the probability distribution of classically 
stabilized vacua and show the emergence of the peaking (at zero) behavior for a small A |^J 
However, in the absence of fine tuning, the smallness of the expected A is still far too big to 
explain the observed dark energy. On the other hand, we see that the above estimates of A 
assume that the parameters entering V (such as Wq and A\ = —A in the superpotential W) 
have uniform or normal distributions. This is a very conservative estimate. 

It is simplest and natural to have Wq = A\ = 0, yielding supersymmetric Minkowski solu- 
tions. So we do expect zeros for them to be very likely. Since other values are also possible, 
as we have pointed out, we believe that their probability distributions naturally peak at zero 
values. In actual cases, they are functions of the stabilized heavy modes, so they themselves 



can be products of other random variables. In the simple examples considered for Wq (5.1) and 



A\ (5.2), we see that this is entirely plausible. That is, by the time they enter into V, their 
probability distributions can be very peaked (at zero) already. This will in general lead to a 
much more peaked behavior (at zero) in the probability distribution of A. In general, we can 
check this key point by considering multi-moduli models and stabilize the moduli one by one 
and check the behavior of the probability distributions of the output parameters. 

It remains to be seen whether the above mechanism is one of the main reasons underlying 
the very small observed A. If that is the case, the smallness of WqA in ( 1.2[ ) (or a similar factor 



in a more realistic multi-moduli potential) renders the potential to be very small and almost 



A power distribution was applied in a priori distribution for the cosmological constant 47 



in the context of 



anthropic principle 48 . However since a priori distribution was considered to come from the inflationary model, 



a priori distribution of the vacuum energy P(V(<f>)) cx 1/|T^'(0)| is mostly constant due to slow-roll conditions 



for inflation 49-51 . 
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flat around the minimum of V, so it will also render the modulus mass to be exponentially 
small. When the modulus mass is light enough, we may worry about the cosmological moduli 
problem [9 10 . The cosmological moduli problem is absent when m tl > O(10) TeV. If we 

naively estimate the physical modulus mass to be m tl ~ >J (j^jMp 9 , we can safely apply the 

mechanism of product distribution for smaller cosmological constant down to A > 10~ 30 , which 
is still far from the observed value of 10~ 123 . It remains to be seen whether the situation will 
improve in a more realistic multi-moduli model. 

In the above single modulus model, it is easy for us to see that the stability condition is 
not too restrictive. That is, among the cases where the extremum condition is satisfied, a 
reasonable fraction V (about a few percent) of the extremum solutions also satisfies the stability 
condition. As we introduce more moduli into the scenario, we expect that this ratio V will 
become Gaussianly suppressed 52 54] (see also [55]). This is simply a result of (mass squared) 
eigenvalue repulsion of the Hessian matrix didjV of the potential V. So the following picture 
emerges. At relatively high vacuum energy densities, there are very few de-Sitter vacua (if any at 
all) in some regions of the landscape (the regions under analysis), because the number of stable 
vacua is Gaussianly suppressed. At lower energies, the number of low scale moduli involved 
decreases, so the probability of stability solutions is no longer Gaussianly suppressed. This 
multi-moduli picture is consistent with the single modulus analysis carried out in this paper; 
how these two pictures merge together will be the challenge for the future. 
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A Other scenarios for the parameters in V of the single 
modulus model 

In this appendix, we consider three more plausible scenarios for the probability distribution of 



A (5.6) 



The physical modulus mass is estimated to be m\ ~ K TT d\V = M%,4x 2 /3d%(V/M£). Since we expect 
~ 3 and (d%(V/Mp)) ~ ( A) in stable positive vacua, we roughly estimate m tl ~ W (A)Mp. 
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Case 1 Although we neglected the randomness of £ in (5.7) because of its discrete behavior 



around zero value, it is possible that we can treat this parameter as a continuous variable passing 
through zero after including stringy loop corrections in the Kahler potential. The stringy loop 



corrections in the Kahler potential are studied in 37 39 56 57] (see also supergravity loop 
correction in [58] ). The typical effect of stringy loop correction can be written down [39], 



K 



9 a K^ K) 9s 
V 



(W) 



V 



(A.l) 



where we expand the Kahler potential (4.1) up to leading order of £ The terms related to 



are from the KK-modes, while terms with Sq W ^ originate from the exchange of winding 



<->a 

strings. They are functions of complex structure moduli, and therefore given as constants 
after stabilization. The g a K is proportional to some two-cycle volume, while the is inversely 
proportional to some two-cycle volume. 

Though these stringy loop corrections depend on the Kahler moduli, they are sub-leading 
terms in general and will not change the stabilization qualitatively under the assumption of large 
volume. If we deal with the string loop corrections as effective constants as well as £ parameter 
for simplicity, we may treat the combined parameter among a! and stringy loop corrections 
appearing in Kahler potential as a random variable passing through zero value. 

Neglecting w\ and recovering dependence, the system of interest now becomes, 

3/2 
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w 2 w 3 (c-c ), 



2tt 



w 2 



C < C < Ci- 



(A.2) 



where Wj have uniform distributions < Wj < 1. Performing integral with constraints in the 



same way as that of (5.9) (the valid region is illustrated in the left hand side of Figure^]), we 



get, after some calculations, 



P(z) 



2ir J 



9/2 



2tt 



9/2 



+ In 



'2tt 



9/2 



1 



< z < 



2tt 



9/2 



h, (A.3) 



where b\ = (c± — cq)/ci. Using (5.10), we can convert to -P(A) (A = A/Mp). The expectation 

2tt\ 9/2 



value and the variance in this case become 

3 (2v\ 9/2 b v 



A 



A 5 



2500^ \N C J 6 



5.46 x 10 



-6 



NJ 



2500^ 



2tt\ 9 b\ 



5.96 x 10 



-li 



2tt 



(A.4) 



for positive A. 
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The ^ parameter itself is also affected by the loop correction to be £ — > £(1 + n g^/3C(3)) (see review 59 ) 



but we neglect this effect here because this correction doesn't help to have smooth behavior around £ = 0. 
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Figure 5: The regions that contribute to P(z,c) in the the c-z plane. The left plot shows the valid region in 



the cases of (5.7), (A. 2), and (A. 5 1, where the region is surrounded by c = c\ and c = c /(l — z). The right plot 
is for (A. 9), where the middle curve c = 1/(1 — ]fmZ) which separates two regions corresponding each function 
in (A.10). Here we assume £ = 1, a\ = l,7i = 1 in the figures. 



Case 2 Next, let us consider the model without taking £ (= w^) random, but recovering a\ (w\) 
as a random variable. The system of our interest now becomes 



w 1 w 2 w 3 (c - c ) 



W 2 



C < C < C\. 



(A.5) 



with m = 3,n = 3/2. Let us first derive the analytic formula with arbitral m,n satisfying 
m > n. We assume that wi,W2,ws obey uniform distribution from to 1 just for simplicity. 
The product distribution for fixed z, c are given by an integration: 



P(z,c) 
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for z < ,^ < c. 



(A.6) 



The remaining task is to obtain the probability distribution -P(z) after integrating over c in the 
appropriate regions. Now let us use the actual numbers of interest to us here: m = 3,n = 
3/2, c = 200V5e- 5 / 2 /3 ~ 12.2 > f. The appropriate region in the c-z plane is illustrated in the 
left hand side of Figure |5j The probability distribution function in this case now becomes 

1 



P(z) =N ~ 1 
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(A.7) 
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where No is the normalization constant. The P(z) for the system (A. 5) is diverging as z 4 / 9 . In 
the absence of the constraint, Table [l] tells us that P(z) 



~ z 



-1+1/3 



-2/3 



, so we see that the 

constraint reduces the divergent power by 2/9. Using this formula (A.7), we can calculate the 
expectation value and variance of A, 



A 



A^ 



2500^ 56 



~ 2.34 x 10~% 



2 5bl 



(A.8) 



2500^/ 207 



2.59 x 10~ n | 2 . 



Since we treated w\ = ai/7 2 as a random parameter here, all possible values for w\ are taken 
into account without fine-tuning. 



Case 3 In the previous case, we assume a uniform distribution for W\. If we consider only the 
SU(N C ) gaugino condensation on D7-branes and assume randomness for N c of a\ = 2n/N c with 
fixed 71, we may expect that probability distribution is more peaked toward A = 0. Here for 
simplicity, we assume uniform distribution from to 1 for w 2 , w 3 , and uniform distribution from 
1 to y m oc N™ ax for y\ = 1/wi, and consider the following constrained product distribution: 
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The distribution for this system can be calculated 
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Now we integrate over c for the appropriate regions illustrated in the right hand side of Figure 
[5j The resulting probability distribution of z can be written by, after some calculations, 



P(z) 



r 1 
r 1 



I'Jn 



64&i( v /y m -l) 



/.</,, 



64( v ^-l)^" 1 2 



-361ny m + (y^-l) (^9 + 81n 
9(^) 1/9 -^( 9 + 81n ^ 



forO<z<ei, 



for < z < C&i- 



(A.11) 



Applying this distribution to obtain the expectation values, we have 
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where we also give the numerical values at y m — > oo. 
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